A discriminant measure for model complexity adaptation

نویسندگان

  • Lalit R. Bahl
  • Mukund Padmanabhan
چکیده

1 ABSTRACT We present a discriminant measure that can be used to determine the model complexity in a speech recognition system. In the speech recogition process, given a test feature vector the conditional probability of the feature vector has to be obtained for several al-lophone (sub-phonetic units) classes using a gaussian-mixture density model for each class. The gaussian-mixture models are constructed from the training data belonging to the allophone classes, and the number of mixture components that are required to adequately model the pdf of each class is determined by using some simple rule of thumb { for instance the number of components has to be suucient to model the data reasonably well but not so many as to overmodel the data. A typical example of the choice of the number is to make it proportional to the number of data samples. However, such methods may result in models that are sub-optimal as far as classiication accuracy is concerned. In this paper we present a new discriminant measure that can be used to determine in an objective fashion, the number of gaussians required to best model the pdf of an allophone class. We also present the results of experiments showing the improvement in recogntion performance when the number of mixture components is chosen based on the discriminant measure as opposed to the rule of thumb. These results are presented both for the speaker-independent and speaker-adapted case. 2 INTRODUCTION We present a discriminant measure that can be used to determine the model complexity in a speech recognition system. In the speech recognition problem, feature vectors are extracted periodically from the input speech and are matched to diierent sequences of phones, that represent words in the vocabulary. In the statistical approach to speech recognition, this is done by estimating the probability density of each phone in the feature space from the training data, and using these pdf's to assign a probability to a test feature vector. The most common case is where a parametric model is used to model the pdf, with the parametric model generally being a mixture of gaussian distributions. Hence, in the speech recogition process, given a test feature vector the conditional probability of the feature vector has to be obtained for several allophone (sub-phonetic units) classes using the gaussian-mixture density model for each class. It is not unusual to use tens or even hundreds of thousands of diierent …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sample-oriented Domain Adaptation for Image Classification

Image processing is a method to perform some operations on an image, in order to get an enhanced image or to extract some useful information from it. The conventional image processing algorithms cannot perform well in scenarios where the training images (source domain) that are used to learn the model have a different distribution with test images (target domain). Also, many real world applicat...

متن کامل

Amputation effects on the underlying complexity within transtibial amputee ankle motion.

The presence of chaos in walking is considered to provide a stable, yet adaptable means for locomotion. This study examined whether lower limb amputation and subsequent prosthetic rehabilitation resulted in a loss of complexity in amputee gait. Twenty-eight individuals with transtibial amputation participated in a 6 week, randomized cross-over design study in which they underwent a 3 week adapt...

متن کامل

Fuzzy Complexity Analysis with Conflict Resolution for Educational Projects

Evaluative and comparative analysis among educational projects remains an issue for administration, program directors, instructors, and educational institutes. This study reports a fuzzy complexity model for educational projects, which has two primary aspects (technical aspects and transparency aspects). These aspects may not be measured precisely due to uncertain situations. Therefore, a fuzzy...

متن کامل

A Novel Method for Detection of Epilepsy in Short and Noisy EEG Signals Using Ordinal Pattern Analysis

Introduction: In this paper, a novel complexity measure is proposed to detect dynamical changes in nonlinear systems using ordinal pattern analysis of time series data taken from the system. Epilepsy is considered as a dynamical change in nonlinear and complex brain system. The ability of the proposed measure for characterizing the normal and epileptic EEG signals when the signal is short or is...

متن کامل

A Model for an Adaptive University

With the increasing complexity and chaos of extracurricular higher education environments in diverse ecosystems, university adaptation to the environment as a social and activist system has become an inevitable necessity. Therefore, this study aims to analyze the content of articles compiled in the context of the University of Adaptation in internal and external research to present the Adaptive...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998